Reinforcement theory

Results: 290



#Item
11Bias in Natural Actor-Critic Algorithms  Philip S. Thomas  Department of Computer Science, University of Massachusetts, Amherst, MAUSA

Bias in Natural Actor-Critic Algorithms Philip S. Thomas Department of Computer Science, University of Massachusetts, Amherst, MAUSA

Add to Reading List

Source URL: psthomas.com

Language: English - Date: 2012-10-01 18:27:53
12POVERTY AND SELF-CONTROL B. Douglas Bernheim Stanford University and NBER Debraj Ray New York University

POVERTY AND SELF-CONTROL B. Douglas Bernheim Stanford University and NBER Debraj Ray New York University

Add to Reading List

Source URL: thred.devecon.org

Language: English - Date: 2014-06-28 16:47:12
13Verification of Markov Decision Processes using Learning Algorithms? Tom´asˇ Br´azdil1 , Krishnendu Chatterjee2 , Martin Chmel´ık2 , Vojtˇech Forejt3 , Jan Kˇret´ınsk´y2 , Marta Kwiatkowska3 , David Parker4 , a

Verification of Markov Decision Processes using Learning Algorithms? Tom´asˇ Br´azdil1 , Krishnendu Chatterjee2 , Martin Chmel´ık2 , Vojtˇech Forejt3 , Jan Kˇret´ınsk´y2 , Marta Kwiatkowska3 , David Parker4 , a

Add to Reading List

Source URL: www.hieratic.eu

Language: English
14PREDICTING WHEN TO LAUGH WITH STRUCTURED CLASSIFICATION Bilal Piot1 , Olivier Pietquin2 , Matthieu Geist1 1 SUPELEC IMS-MaLIS research group and UMIGeorgiaTech - CNRS) 2

PREDICTING WHEN TO LAUGH WITH STRUCTURED CLASSIFICATION Bilal Piot1 , Olivier Pietquin2 , Matthieu Geist1 1 SUPELEC IMS-MaLIS research group and UMIGeorgiaTech - CNRS) 2

Add to Reading List

Source URL: www.metz.supelec.fr

Language: English - Date: 2014-07-15 03:12:51
15Bandits all the way down: UCB1 as a simulation policy in Monte Carlo Tree Search Edward J. Powley, Daniel Whitehouse, and Peter I. Cowling Department of Computer Science York Centre for Complex Systems Analysis Universit

Bandits all the way down: UCB1 as a simulation policy in Monte Carlo Tree Search Edward J. Powley, Daniel Whitehouse, and Peter I. Cowling Department of Computer Science York Centre for Complex Systems Analysis Universit

Add to Reading List

Source URL: eldar.mathstat.uoguelph.ca

Language: English - Date: 2016-07-12 12:05:04
16Coordination in Multiagent Reinforcement Learning: A Bayesian Approach Georgios Chalkiadakis Craig Boutilier

Coordination in Multiagent Reinforcement Learning: A Bayesian Approach Georgios Chalkiadakis Craig Boutilier

Add to Reading List

Source URL: www.intelligence.tuc.gr

Language: English - Date: 2009-03-02 16:24:03
17RAAM: The Benefits of Robustness in Approximating Aggregated MDPs in Reinforcement Learning Dharmashankar Subramanian IBM T. J. Watson Research Center Yorktown Heights, NY 10598

RAAM: The Benefits of Robustness in Approximating Aggregated MDPs in Reinforcement Learning Dharmashankar Subramanian IBM T. J. Watson Research Center Yorktown Heights, NY 10598

Add to Reading List

Source URL: marek.petrik.us

Language: English - Date: 2016-07-14 09:59:52
18Journal of Artificial Intelligence Research  Submitted 3/13; publishedA Survey of Multi-Objective Sequential Decision-Making Diederik M. Roijers

Journal of Artificial Intelligence Research Submitted 3/13; publishedA Survey of Multi-Objective Sequential Decision-Making Diederik M. Roijers

Add to Reading List

Source URL: arxiv.org

Language: English - Date: 2014-02-04 20:03:22
19Inverse Reinforcement Learning for Interactive Systems∗ [Extended Abstract] Olivier Pietquin SUPELEC - UMIGeorgiaTech-CNRS) 2 rue Edouard BelinMetz - France

Inverse Reinforcement Learning for Interactive Systems∗ [Extended Abstract] Olivier Pietquin SUPELEC - UMIGeorgiaTech-CNRS) 2 rue Edouard BelinMetz - France

Add to Reading List

Source URL: www.ilhaire.eu

Language: English - Date: 2013-10-03 05:33:46
201  On Stochastic Feedback Control for Multi-antenna Beamforming: Formulation and Low-Complexity Algorithms Sun Sun, Min Dong, and Ben Liang

1 On Stochastic Feedback Control for Multi-antenna Beamforming: Formulation and Low-Complexity Algorithms Sun Sun, Min Dong, and Ben Liang

Add to Reading List

Source URL: www.comm.utoronto.ca

Language: English - Date: 2014-05-05 14:44:36